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Abstract 

This paper is placed at the intersection-point between the study of theoretical computational models aimed 
at capturing the essence of genetic regulatory networks and the field of Artificial Embryology (or Computational 
Development). A model is proposed, with the objective of providing an effective way to generate arbitrary forms 
by using evolutionary-developmental techniques. Preliminary experiments have been performed. 



1 Related Work 



One of the oldest computational models of development is perhaps represented by L- Systems, proposed by Lin- 
denmayer ((Lindenmayer , 1968| )). L-Systems capture the notion of cell signalling and were proposed to model plant 
growth. Rewriting or Production Systems (see (Dittrich et al. , 2001 )) consist of certain entities or symbols ("chemical 
compounds") and a set of syntactic rules for performing replacements ("chemical reactions"). Examples of P-systems 
are CHAM (Chemical Abtract Machine) and ARMS (Abstract Rewriting Systems on Multisets). Random Boolean 
Networks (RBN's) were originally developed by S. Kaufmann as a model of genetic regulatory networks ( ( [Kauffman] 
[l969J ). In the context of the development of multi-cellular organisms, the attractors of RBN's (i.e. the stable states) 
are interpreted as the different "cell types" of the organism. Torsten Reil (Reil, 1999) developed a model called 



"artificial genome" with biologically-plausible properties and studied the dynamics of gene expression. The model 
differs from other approaches, such as RBN's, in that it is entirely based on template-matching in a nucleotide-like 



sequence. Hugo de Garis (De Garis, 1999) developed a model for evolving convex and non-convex shapes in 2D 



reproductive cellular automata. Two genes, which were called operons were used to regulate the growth of each of 
the arms of the shape. The model was successful in evolving convex shapes but the non-convex shapes (L-shape) 
presented a problem. Miller (Miller 2003) developed artificial organisms (the french flag) based on Cartesian Genetic 
Programming which is an extension of Boolean networks. Miller's goal is to evolve a developmental program inside 
a cell to create multicellular organisms. 



2 Introduction and Cell Variables 

We represented the phenotype as a 2-dimensional grid, each pixel representing a single cell, the set of all cells being 
the organism. Cell's variables are broken down into four categories: spacetime memory, genetic memory, epigenetic 
memory, temporary memory or substance space (SUSP): figure 1 lists all variables. Through the interplay of these 
variables, at each cycle a given cell can proliferate, can undergo apoptosis, and can express its phenotype (represented 
by a color). 

3 The Cell Cycle, Development Phase 

At each BC value, each cell of the organism carries out the cell cycle, which is divided in two phases: development 
phase and expression phase, (see figure 2). During the development phase, the cell variables BC (body clock) and 
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Figure 1: Cell variables. 



TM (type marker) are fed to DGM to yield the developmental behaviour of the cell. The cell can either proliferate 
in a rectangle containing the (mother) cell and whose coordinates are specified in DGM, undergo apoptosis (i.e. 
disappear from the grid) or do nothing. If the cell "chooses" to proliferate, it can do it in two distinct "modes": 
growth mode and differentiation mode. In the growth mode the newly created cells are assigned the TM value of the 
mother; in the differentiation mode each newly created cell is assigned a new value of TM (and TM is incremented 
by one at each new assignment). At the beginning a single cell is put in the grid with TM=0 (zygote). DGM (see 
figure 3) is structured as a list of "development operators". Each operator has a BC value and a TM value: these are 
the values for which the operator gets activated. Another field holds a "master switch" that defines the type of event 
that is going to occur: growth- mode proliferation (value=0), differentiation-mode proliferation (value=l), apoptosis 
(value=2). The last field contains the coordinates of the north-west and south-east vertices of the proliferation 
rectangle 

The different types of development operators correspond to a painter's tools: the rectangle proliferation coresponds 
to the brush, the apoptosis corresponds to the eraser, the differentiation mode serves to "mark" individual cells 
(through the assignment of different TM values) fur further processing in subsequent stages. The presence of an 
explicit epigenetic memory, which enables the "cell tracking" , costitutes the most relevant feature of our method. 

4 The Cell Cycle, Expression Phase 

During the expression phase, the substance space SUSP is processed through EGM and the result is used to determine 
the cell phenotype (represented as a color). EGM is structured as an array of expression operators and SUSP is 
structured as an array of operands (artificial counterpart of biological chemical substances), that can be broken down 
into three categories: 

• operands that are exhanged with other cells (input operands) 

• operands that determine the phenotype (output operands) 

• operands that are neither inputs nor outputs (hidden operands) 

At the beginning of the expression phase, the input part of SUSP is initialized (the SUSPs of surrounding cells 
are mapped onto it). Then each operator of EGM "binds" its operands in SUSP and carries out its operation, 
updating SUSP as a result. This sequence of operations is repeated n times (n is a parameter). At the end the 
phenotype of the cell (color) is determined from the value of the output operands. An operand (opd) has two fields, 
one (opd. lab) holding the operand's "label" (corresponding to the chemical "type"), the other (opd.val) holding 
the operand's "value" (corresponding to the chemical concentration). The structure of the operator (oxr) is more 
complicated. The first field (oxr. inputs) is composed of n (n is the arity of the operator) sub-fields, each corresponding 
to a different input (oxr.inputs[i]). Each sub-field is composed of an operand (inputs [i]. opd) and two parameters to 
process that operand, one for each operation's step. The second field is composed of two (output) labels, oxr.outlabO 
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Figure 2: The cell cycle (genetic memory in black, epigenetic memory in grey, temporary memory in white dotted)) 
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Figure 3: Development Genetic Memory)) 



and oxr.outlabl, the third by the output operand (oxr.outopd). An operation (see figure 4) is carried out in two 
steps. The first step determines the value part of the output (oxr.outopd.val), the second step determines the label 
part of the output (oxr.outopd. lab), choosing between two possible labels, oxr.outlabO and oxr.outlabl. 

The resulting computation fiow is similar to that of a neural network, similar to the model proposed by Geard 
([4]), with the difference that, in this case, we have an alternation of layers of "fixed" connections and layers of 
"variable" connections (the variability being determined by the possibility of choosing between two output labels). 
For a given clock value and for a given type marker value, only a subset of expression operators are active, being this 
subset specified by special development operators. 



5 Biological Interpretation and Motivations of the Model 

DGM and EGM correspond to the chromosome (s), and the development and expression operators correspond to the 
individual genes. The epigenetic memory corresponds to the biological epigenetic memory, implemented in real cells 
through the methylation of genes that inhibits their trascription. The operands in the substance space correspond 
to RNA pieces and proteins participating in regulatory pathways during the interphase. The possibility of choosing 
on-the-fiy between two labels for the output operand is equivalent to the conditional branching feature and has its 
biological counterpart in the alternative splicing mechanism, i.e. the possibility that more proteins can be read out 
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Figure 4: Expression Phase Operation)) 



from a single gene. Our model's cell cycle differs from the biological one in that in the development phase a cell 
can proliferate over a rectangular area, which in real cells can be achieved only through a series of mitoses, each 
belonging to a distinct cell cycle. The aspects we are interested to model are those related to the shaping of an 
organism, achieved in the living world through the mitosis, differentiation and apoptosis of individual cells. We 
decided to separate these functions into a development part, that provides the overall "shape" of the organism, and 
an expression part, that provides the phenotype of individual cells. The development part is modeled at a relatively 
high level (a single development operator can in theory make a cell cover the entire grid), in order to speed-up the 
"painting" process. The source of the different behaviour of a cell is provided in the development phase by the 
variable TM (type marker), and in the expression phase by the variable TM (that determines the activation pattern 
of the expression genes) and the behaviour of surrounding cells (that provides the initial state of SUSP). We believe 
that the surrounding environment alone is not sufficient to explain the different behavioural patterns observed in 
real biological systems. 

6 Implementations 

Preliminary experiments have been performed, for the moment only limited to the development part (creation of 
colorless shapes). These experiments were aimed at evaluating the capability of the model to "paint" in an effective 
way, i.e. to evolve shapes. The algorithm was able to successfully evolve some simple shapes, including non-convex 
ones. Further experiments are under way. These first attempts have shown the emergence of a "painting technique", 
consisting in a fast differentiation in the first cycles, followed by touch- ups in the subsequent cycles. The genetic 
algorithm "understands" that the first thing to do is to create a relatively big mass of cells, marking each of them 
with a different type marker. This is the only means by which it can later pick individual cells and commit them to 
a specific fate. Examples of target shapes are reported below (in black), with the best evolved solution (in color). 
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Figure 5: The L and the beetle. 
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Figure 6: Development sequence of the plane. 



Figure 7: The house. 



Figure 8: The map of Italy. 
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